On Inference of Partially Correlated Data
نویسندگان
چکیده
the size of each subset. However, in case the missing ness not completely at random (MCAR), Looney and Jones [9] argued that ignoring some of the correlated observations would bias the estimation of the variance of the difference in treatment means and would dramatically affect the performance of the statistical test in terms of controlling type I error rates and statistical power [10]. They propose a corrected z-test method to overcome the challenges created by ignoring some of the correlated observations. However, our preliminary investigation shows that the method of Looney and Jones [9] pertains to large samples and is not the most powerful test procedure. Furthermore, Rempala and Looney [11] studied asymptotic properties of a two-sample randomized test for partially dependent data. They indicated that a linear combination of randomized t-tests is asymptotically valid and can be used for non-normal data. However, the large sample permutation tests are difficult to perform and only have some optimal asymptotic properties in the Gaussian family of distributions when the correlation between the paired observations is positive. Other researchers, such as Xu and Harra [12] and Konietschke et al. [13] also discuss the problem for continuous variables including the normal distribution by using weighted statistics. However, the procedure suggested by Xu and Harra [12] is a functional smoothing to the Looney and Jones [9] procedure. As such, the Xu and Hara procedure is not a practical alternative for the non-statistician researcher. The procedure suggested by Konietschke et al. [13], is a nonparametric procedure based on ranking.
منابع مشابه
A Geometry Preserving Kernel over Riemannian Manifolds
Abstract- Kernel trick and projection to tangent spaces are two choices for linearizing the data points lying on Riemannian manifolds. These approaches are used to provide the prerequisites for applying standard machine learning methods on Riemannian manifolds. Classical kernels implicitly project data to high dimensional feature space without considering the intrinsic geometry of data points. ...
متن کاملEfficient Estimation in Panel Data Partially Additive Linear Model with Serially Correlated Errors
The partially linear additive model arises in many scientific endeavors. In this paper, we look at inference given panel data and a serially correlated error component structure. By combining polynomial spline series approximation with least squares and the estimation of correlation, we propose a weighted semiparametric least squares estimator (WSLSE) for the parametric components, and a weight...
متن کاملA Pairwise Likelihood Method for Correlated Binary Data With/without Missing Observations under Generalized Partially Linear Single-index Models
Correlated data, such as multivariate or clustered data, arise commonly in practice. Unlike analysis for independent data, valid inference based on such data often requires proper accommodation of complex association structures among response components within clusters. Semiparametric models based on generalized estimating equations (GEE) methods, and their extensions, have become increasingly ...
متن کاملEffect of Relocation and Rotation on Radial Efficiency Scores for a Partially Negative Data Problem
Negative data handling has gained a remarkable importance in the literature of Data Envelopment Analysis (DEA) to address many real life problems. Various erstwhile applications, in this arena, referred relocation of the origin to a superior (RDM) or to an inferior (Translated Input Oriented BCC) neighboring point. In this paper, the conditions for Rotation Invariance of various Data Envelop...
متن کاملVerb Sense And Subcategorization: Using Joint Inference To Improve Performance On Complementary Tasks
We propose a general model for joint inference in correlated natural language processing tasks when fully annotated training data is not available, and apply this model to the dual tasks of word sense disambiguation and verb subcategorization frame determination. The model uses the EM algorithm to simultaneously complete partially annotated training sets and learn a generative probabilistic mod...
متن کامل